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About me: 


Working on Linux platforms since 2004, with a background on 
embedded devices. 


Contributed to Intel’s Graphics Performance Analyzers tools for 
Android OpenGLES applications 2011-2014. 


Joined Mesa in 2014, working on performance tools and 
automation. 


GPU Performance Analysis Workflow intel) 


e Investigate system bottlenecks first 
- top, gputop, rapl 


100% GPU utilization with lower CPU utilization indicates 
a GPU-bound workload 


TDP limited workloads cause GPU clock rate to fall. 
- MESA DEBUG=perf 


GPU Performance Analysis Workflow intel) 


e CPU Bound workloads have traditional tools 
- perf, callgrind, cachegrind, sysprof 
e GPU performance analysis has a sparse landscape of Linux tools 
- AMD GPU PerfStudio, Nvidia Linux Graphics Debugger, QApiTrace 


- Leverage GPU hardware counters to quantify the cost of asynchronous 
GPU operations. 


- Live experimentation to see the effect on performance. 


- Deeply investigate a graphics workload. 


GPU Tools stumbling blocks (intel) 


e Generally hardware-specific 

e Mostly closed source 

e Linux support is an afterthought 
e Tracing/retracing not reliable 


e Low numbers of users 


e Mesa support for GPU performance counters 


intel) 


FrameRetrace: frame analysis based on Apilrace 


e Widely used and high quality trace/retrace 
°. https://github.com/janesma/apitrace 
e Cross-platform: Linux and Windows 


e Upstream GPU Counter support in Mesa and Kernel for Haswell and 
later. 


e Leveraged by Intel Mesa team to identify and fix several 
performance issues in 1965. 


FrameRetrace: frame analysis based on Apilrace 


e GPU Metrics for each render 

e Render target visualization and experiments 
e Api log 

e Batch disassembly 

e Shader analysis, live editing, and assembly 


e Uniform constant display and live editing 


e Render experiments 
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Other features 


intel) 


e Windows support provides important leverage for open source driver teams seeking to 
find Mesa performance gaps. 


e Proposed features: 


Display and modify GL State 

Display texture state, with mip clamp experiment 
Display geometry mesh 

Depth buffer visualization 

Overdraw / hotspot rendertarget visualization 


Ul improvements 


Support for more hardware 


Caveats 


e Currently a one-person side project, with help 


- Thanks to Laura Ekstrand, Robert Bragg, Lionel Landerwelin, Eero 
Taminen, Pekka Jylha-Ollila 


e Experiments require intricate state tracking 


e Some workloads do not have single-frame run loops 
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Questions? 


